04:00
2026-06-14
lesswrong.com
machine-learning
Speeding Up JumpReLU SAE Inference with Custom Triton Kernels (2β14Γ on Real SAEs)
Researchers developed custom Triton kernels that accelerate JumpReLU Sparse Autoencoder inference by 2β14Γ on real SAEs, exploiting activation sparsity to skip zero entries during matrix multiplicatioβ¦